Name Disambiguation Method Based on Multi-step Clustering
نویسندگان
چکیده
منابع مشابه
A Multi-stage Clustering Framework for Chinese Personal Name Disambiguation
This paper presents our systems for the participation of Chinese Personal Name Disambiguation task in the CIPSSIGHAN 2010. We submitted two different systems for this task, and both of them all achieve the best performance. This paper introduces the multi-stage clustering framework and some key techniques used in our systems, and demonstrates experimental results on evaluation data. Finally, we...
متن کاملClustering Technique in Multi-Document Personal Name Disambiguation
Focusing on multi-document personal name disambiguation, this paper develops an agglomerative clustering approach to resolving this problem. We start from an analysis of pointwise mutual information between feature and the ambiguous name, which brings about a novel weight computing method for feature in clustering. Then a trade-off measure between within-cluster compactness and among-cluster se...
متن کاملAn Improved Name Disambiguation Method Based on Atom Cluster
An improved name disambiguation method based on atom cluster. Aiming at the method of character-related properties of similarity based on information extraction depends on the character information, a new name disambiguation method is proposed, and improved k-means algorism for name disambiguation is proposed in this paper. The cluster analysis cluster is introduced to the name disambiguation p...
متن کاملA Term-Based Driven Clustering Approach for Name Disambiguation
Name disambiguation in databases is a non-trivial task because people’s names are often not unique and usually only a limited information is associated with each name in the database. For example, in DBLP many authors share the same name, whereas we do not have any unique identifier to distinguish them. To make it worst, we may not always be able to access the full contents of the materials, un...
متن کاملA Heuristic-based Hierarchical Clustering Method for Author Name Disambiguation in Digital Libraries
In this paper, we propose a heuristic-based hierarchical clustering (HHC) method to deal with the name disambiguation problem. The method successively fuses clusters of citations of compatible authors based on several heuristic and similarity measures on the components of the citations (e.g., coauthors, title of the work, publication venue). In each phase, the information of fused clusters is a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Computer Science
سال: 2016
ISSN: 1877-0509
DOI: 10.1016/j.procs.2016.04.237